Split support and split conflict randomization tests in phylogenetic inference.

نویسنده

  • M Wilkinson
چکیده

Randomization tests allow the formulation and statistical testing of null hypotheses about the quality of entire data sets or the quality of fit between the data and particular phylogenetic hypotheses. Randomization tests of phylogenetic hypotheses based on the concepts of split support and split conflict are described here, as are tests where splits, rather than the data, are randomly permuted. These tree-independent randomization tests are explored through their application to phylogenetic data for caecilian amphibians. Of these tests, split support randomization tests appear to be the most promising tools for phylogeneticists. These tests seem quite conservative, are applicable to nonpolar data and unordered multistate characters, and do not have the problems of nonindependence that affect split conflict and hierarchy tests. Unlike split conflict tests, their power does not appear to be correlated with split size. However, all tests are sensitive to taxonomic scope. Split support tests may help discern data that are likely to be affected by the problems of long-branches effects. Comparison of test results for mutually incompatible splits may help identify the presence of strong misleading signals in phylogenetic data. Significant split support could be a prerequisite for considering phylogenetic hypotheses to be well supported by the data, and split support randomization tests might be usefully applied prior to or as part of tree construction.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evolutionary relationships of human populations on a global scale.

Using gene frequency data for 29 polymorphic loci (121 alleles), we conducted a phylogenetic analysis of 26 representative populations from around the world by using the neighbor-joining (NJ) method. We also conducted a separate analysis of 15 populations by using data for 33 polymorphic loci. These analyses have shown that the first major split of the phylogenetic tree separates Africans from ...

متن کامل

Prediction of soil cation exchange capacity using support vector regression optimized by genetic algorithm and adaptive network-based fuzzy inference system

Soil cation exchange capacity (CEC) is a parameter that represents soil fertility. Being difficult to measure, pedotransfer functions (PTFs) can be routinely applied for prediction of CEC by soil physicochemical properties that can be easily measured. This study developed the support vector regression (SVR) combined with genetic algorithm (GA) together with the adaptive network-based fuzzy infe...

متن کامل

Minimum conflict: a divide-and-conquer approach to phylogeny estimation

MOTIVATION Fast and reliable phylogeny estimation is rapidly gaining importance as more and more genomic sequence information is becoming available, and the study of the evolution of genes and genomes accelerates our understanding in biology and medicine alike. Branch attraction phenomena due to unequal amounts of evolutionary change in different parts of the phylogeny are one major problem for...

متن کامل

Taxon Selection under Split Diversity.

The "phylogenetic diversity" (PD) measure of biodiversity is evaluated using a phylogenetic tree, usually inferred from morphological or molecular data. Consequently, it is vulnerable to errors in that tree, including those resulting from sampling error, model misspecification, or conflicting signals. To improve the robustness of PD, we can evaluate the measure using either a collection (or dis...

متن کامل

Inferring the higher-order phylogeny of mosses (Bryophyta) and relatives using a large, multigene plastid data set.

PREMISE OF THE STUDY Investigating the early diversification of major clades requires well-corroborated and accurate phylogenetic inferences. We examined the performance of a large set of plastid genes for inferring the broad phylogenetic backbone of mosses-the second largest major clade of land plants-and their nearest relatives. METHODS We surveyed 14-17 plastid genes from a broadly represe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Systematic biology

دوره 47 4  شماره 

صفحات  -

تاریخ انتشار 1998